An evaluation of many-to-one voice conversion algorithms with pre-stored speaker data sets

نویسندگان

  • Daisuke Tani
  • Yamato Ohtani
  • Tomoki Toda
  • Hiroshi Saruwatari
  • Kiyohiro Shikano
چکیده

This paper describes an evaluation of many-to-one voice conversion (VC) algorithmsconverting an arbitraryspeaker’s voice into a particular target speaker’s voice. These algorithmseffectively generatea conversionmodel for a new source speaker using multiple parallel data sets of many pre-storedsource speakers and the single target speaker. We conducted experimental evaluations for demonstrating the conversion performance of each of the many-to-oneVC algorithms, including not only the conventional algorithmsbased on a speaker independentGMM and on eigenvoice conversion (EVC), but also new algorithms based on speaker selection and on EVC with speaker adaptive training (SAT). As a result, it is shown that an adaptation process of the conversionmodel improves significantlyconversion performance,and the algorithmbased on speaker selection works well even when using a very limited amount of adaptation data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Doctoral Thesis Techniques for Improving Voice Conversion Based on Eigenvoices

Voice conversion (VC) is a technique for converting a source speaker’s voice into another speaker’s voice without changing linguistic information. As a typical approach to VC, a statistical method based on Gaussian mixture model (GMM) is used widely. A GMM is trained as a conversion model using a parallel data set composed of many utterance-pairs of source and target speakers. Although this fra...

متن کامل

Many-to-many eigenvoice conversion with reference voice

In this paper, we propose many-to-many voice conversion (VC) techniques to convert an arbitrary source speaker’s voice into an arbitrary target speaker’s voice. We have proposed one-tomany eigenvoice conversion (EVC) and many-to-one EVC. In the EVC, an eigenvoice Gaussian mixture model (EV-GMM) is trained in advance using multiple parallel data sets of a reference speaker and many pre-stored sp...

متن کامل

An improved one-to-many eigenvoice conversion system

We have previously developed a one-to-many eigenvoice conversion (EVC) system enabling the conversion from a specific source speaker’s voice into an arbitrary target speaker’s voice. In this system, eigenvoice Gaussian mixture model (EV-GMM) is trained in advance with multiple parallel data sets composed of utterance pairs of the source and many pre-stored target speakers. The EV-GMM is effecti...

متن کامل

Regression approaches to voice quality controll based on one-to-many eigenvoice conversion

This paper proposes techniques for flexibly controlling voice quality of converted speech from a particular source speaker based on one-to-many eigenvoice conversion (EVC). EVC realizes a voice quality control based on the manipulation of a small number of parameters, i.e., weights for eigenvectors, of an eigenvoice Gaussian mixture model (EV-GMM), which is trained with multiple parallel data s...

متن کامل

Cross-language voice conversion based on eigenvoices

This paper presents a novel cross-language voice conversion (VC) method based on eigenvoice conversion (EVC). Crosslanguage VC is a technique for converting voice quality between two speakers uttering different languages each other. In general, parallel data consisting of utterance pairs of those two speakers are not available. To deal with this problem, we apply EVC to cross-language VC. First...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007